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Description 



[0001] The possibilities of preparing hybrid genes by gene technology open up new routes for the worklng-up of 
recombinant proteins. By linidng the coding gene sequence of a desired protein with the coding gene sequence of a 

5 protein fragment having a high affinity for a ligand (affinity peptide) it is possible to purify desired recombinant proteins 
in the form of fusion proteins in one step using the affinity peptide. By site-directed mutagenesis It is also possible to 
introduce specific chemical or enzymatic cleavage sites at the point of linkage of the affinity peptide and the desired 
recombinant protein, so that after the purification of the fusion protein by means of a suitable affinity resin the desired 
recombinant protein can be recovered by chemical or enzymatic cleavage. 

10 [0002] However, recovering of the desired recombinant protein may turn out to be extremely difficult when the desired 
recombinant protein also contains such chemical or enzymatic cleavage sites in its amino acid sequence. In such 
cases, the desired recombinant protein ends up being rapidly degraded. 

[0003] In order to inhibit this degradation, the present invention provides fusion proteins and processes which allow 
the selective cleavage at a specific chemical or enzymatic cleavage site without affecting the desired recombinant 
IS protein. The methods of the present invention are specifically applicable for production of hydrophobic polypeptides, 
proteins or peptides. 

[0004] In more detail, the present Invention is concerned with fusion proteins of the fomiula: 



wherein A is a bulky hydrophilic peptide, B is a selected cleavage site, and C is a desired hydrophobic polypeptide, 

protein or peptide. 

[0005] The temn "bulky hydrophilic peptide" which is used in connection with the fusion proteins in accordance with 
25 the invention relates to hydrophilic peptides which are characterized by their size and the content of hydrophilic amino 
acids giving rise to a well structured domain. 

[0006] The bulky hydrophilic peptide A of the fusion proteins in accordance with the invention serves a dual function: 
a) to facilitate high expression of the fusion proteins and b) to expose the cleavage site C to the mobile phase on a 
hydrophobic matrix column. Prefen-ed bulky hydrophilic peptides of the fusion proteins in accordance with the invention 
30 are those with the peptide sequence of the formula 

(NANP)x 

35 Wherein x is 1 0 > 40 and 1 9 being most preferred. 

[0007] As selected cleavage sites there come into consideration chemical or enzymatic cleavage sites. As suitable 
selected enzymatic cleavage sites there come into consideration the amino acid sequences -(Asp)p-Lys-, wherein n 
signifies 2, 3 or 4. or -Me-Glu-Gly-Arg- which can be specifically recognized by the proteases enterokinase and coag- 
ulation factor Xg, respectively, an arginine residue or a lysine residue cleaved by trypsin, a lysine residue cleaved by 

40 lysyl endopeptidase or a gtutamine residue cleaved by V8 protease. As suitable selected chemical cleavage sites there 
come into consideration tryptophan residues cleaved by 3-bromo-3-methyl-2- (2-nitrophenylmercapto)-3H-indole, 
cystein residues cleaved by 2-nitroso-5-thiocyano benzoic acid, the amino acid dipeptldes Asp-Pro or Asn-Gly which 
can be cleaved by acid and hydroxylamine, respectively, and preferably, a methionine residue which is specifically 
cleaved by cyanogen bromide (CNBr). 

^ [0008] The tenri "hydrophobic polypeptide, protein or peptide" which is used in connection with the fusion proteins 
in accordance with the invention relates to hydrophobic polypeptides, proteins or peptides which elute from reversed 
phase HPLC columns at concentrations between 30 and 60%, preferably around 40% of organic solvents in aqueous 
buffer, e.g. at a concentration of higher than 40% ethanot In aqueous buffer. 

[0009] As hydrophobic polypeptides, proteins or peptides there come into consideration, for example, surface anti- 
50 gens, lymphokine receptors, HlV-1 and HWZ-Z envelope and structure proteins, hepatitis C envelope and structure 
proteins or any peptide with membrane anchor sequences. A preferred hydrophobic peptide is the peptide having the 
sequence: 



lOL^VEICaXDQQL^ (SeqID No:ll) 
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The bulky hydrophiltc peptides and the selected cleavage sites of the fusion proteins in accordance with the invention 
can be lini<ed either to the amino terminal amino acid or to the carboxy temiinal amino acid of the hydrophobic polypep- 
tide, protein or peptide. 

[0010] The fusion proteins in accordance with the present invention can optionally. contain specific sequences that 
s preferably bind to an affinity canrier material. Examples of such sequences are sequences containing at least two 
adjacent histidine residues (see in this respect European Patent Application Publication No. 282 042). Such sequences 
bind selectively to nitrilotriacetic acid nickel chelate resins (Hochuli and Dobeli, Biol.Chem. Hoppe-Seyler 368, 748 
(1 987); European Patent No. 253 303). Fusion proteins of the present invention which contain such a specific sequence 
can, therefore, be separated selectively from the remaining polypeptides. The specific sequence can be linked either 
10 to the amino acid sequence of the bulky hydrophilic peptide or the amino acid sequence of the hydrophobic polypeptide, 
protein or peptide. 

[0011] The present invention is also concemed with genes which code for these fusion proteins, expression vectors 
which contain these genes, microorganisms transformed with these expression vectors as well as a process for the 
preparation of said genes, expression vectors and transfomied microorganisms. 
IS [001 2] The preparation of the fusion proteins in accordance with the invention can be effected according to methods 
of recombinant DNA technology which are described in the literature. Preferably, a nucleotide sequence coding for the 
desired hydrophobic polypeptide, protein or peptide is firstly synthesized and this is then linked with a nucleotide se- 
quence coding for the bulky hydrophilic peptide and the selected cleavage site. 

[0013] The incorporation of the thus-obtained hybrid gene in expression vectors is also effected in a manner known 
20 per se. In this context reference can be made to the textbooks of Maniatis et al. ("Molecular Cloning", Cold Spring 
Harbor Laboratory, 1 982) and Sambrook et al. ("Molecular Cloning-A Laboratory Manual", 2nd. ed., Cold Spring Hari3or 
Laboratory, 1989). 

[0014] The methods for the expression of the fusion proteins in accordance with the invention are also known per 
se and are described in detail In the aforementioned textbooks. They embrace the following procedures: 

25 

a) Transfonnation of a suitable host organism, advantageously E.coli, with an expression vector in which an afore- 
mentioned hybrid gene is operatlvely bonded to an expression control sequence; 

b) cultivation of the thus-obtained host organism under suitable growth conditions; and 

30 

c) extraction and Isolation of the desired fusion protein from the host organism. 

[0015] As host organisms there come into consideration gram-negative and gram-positive bacteria, for example E. 
coll and B.subtilis strains. E.coli strain Ml 5 Is an especially preferred host organism of the present Invention. Apart 

35 from the above-mentioned E.coli strain there can, however, also be used other generally accessible E. coll strains, for 
example E. coli 294 (ATCC No. 3144), E.coli RR1 (ATCC No. 31343) and E. coli W3110 (ATCC No. 27325). 
[001 8] The fusion proteins in accordance with the present invention allow the selected cleavage at a specific chemical 
or enzymatic cleavage site without affecting the desired hydrophobic polypeptide, protein or peptide. Diffusion of the 
desired hydrophobic polypeptide, protein or peptide into the solid phase of a hydrophobic matrix column enables to 

40 orient the fusion proteins in accordance with the present invention so as to hide the desired hydrophobic polypeptide, 
protein or peptide. The bulky hydrophilic peptide on the other hand exposes the selected cleavage site to the mobile 
aqueous phase. This allows one to remove the bulky hydrophilic peptide by selected cleavage leaving only the desired 
hydrophobic polypeptide, protein or peptide bound to the column. The desired hydrophobic polypeptide, protein or 
peptide can then be eluted by addition of organic solvents. 

^ [0017] Hence, the present invention also provides a process allowing the production and purifteatlon of a desired 
hydrophobic polypeptide protein or peptide, which process comprises the steps of: 

a) passing an aqueous solution containing a fusion protein in accordance with the present invention through a 
hydrophobic matrix column, 

so 

b) flushing the column with a solution containing a cleavage reagent or an enzyme, and 

c) removing the resulting desired hydrophobic polypeptide, protein or peptide with an aqueous water miscible 
solvent. 

55 

[0018] As hydrophobic matrix columns there come Into consideration cyanopropyl, cyclohexyi, phenyl, octyl or oc- 
tadecyl group bonded silica matrix columns. In the preferred practice of the invention RP - 16 (octadecyl bound silica 
microparticle column) under reversed phase high perfomnance liquid chromatography (IHPLC) conditions is used. 
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[0019] Prior to the loading with the fusion protein in accordance with the invention, the hydrophobic matrix column 
is conveniently equilibrated with an aqueous buffer. The equilibration buffer can contain a denaturing agent or a cha* 
otropic agent, for example guanidine-HCL, urea or a detergent, e.g. Triton. The addition of such a denaturing agent, 
chaotropic agent or detergent permits problem-free operations even with fusion proteins in accordance with the inven- 
5 tion which are extremely difficult to solublize in aqueous solution. 

[0020] The fusion protein in accordance with the present Invention is applied onto the hydrophobic matrix column In 
aqueous buffer which can also contain a denaturing agent or a detergent, for example guanidine-HCL, urea or Triton. 
[0021] Cleavage Is perfonned by flushing the column with an aqueous buffer containing a cleavage reagent or an 
enzyme. The optimal buffer composition depends on the cleavage reagent or enzyme used and Is conveniently deter- 
ge mined on a case-by-case basis. 

[0022] The elution of the desired hydrophobic polypeptides, proteins or peptides can be carried out using a gradient 
of an aqueous water miscible solvent. Suitable water miscible solvents for this purpose include alkanols such as n- 
propanol, 2-propanol, ethanol, methanol, tert- butanol or cyclic ethers such as dioxane. The optimal elution conditions 
depend on the desired hydrophobic polypeptide, protein or peptide to be purified, the hydrophobic matrix, the column 
IS dimensions etc. and are conveniently detemiined on a case-by-case basis. 

[0023] The aforementioned process allowing the production and purification of a desired hydrophobic polypeptide 
protein or peptide can also be carried out batch-wise. The fusion protein in accordance with the present invention is 
then absorbed to a hydrophobic matrix in aqueous buffer. Cleavage is perfomied by incubating the hydrophobic matrix 
with an aqueous buffer containing the cleavage reagent or the enzyme. The desired hydrophobic polypeptide can be 
20 obtained by incubating the hydrophobic matrix with the aqueous water miscible solvent after removal of the cleavage 
reagent or enzyme and the bulky hydrophilic peptides. 

[0024] The novel process allowing production and purification of a desired hydrophobic polypeptide, protein or pep- 
tide is also employed to purify HIV-1 envelope peptide with Seq ID No:11 to homogeneity using analogous conditions 
as those used in reference Example 1 . 
25 [0025] The peptide with Seq ID No: 11 obtained by the novel process may be used in diagnosing HIV Infections. 
[0026] These Examples can be understood better when they are read in conjunction with the accompanying Figures. 
The following symbols appear in these Figures: 

'N250PSN250P29' represents the regulatable promoter/operator element N250PSN250P29, 'RBSII' represents 
30 the synthetic ribosomal binding site RBSII; '[Hisje*, '[NANP]19' and 'amy' represent the genes encoding the 

6xHis-NANP-amyloid fusion proteins of this invention; 'bla', 'oaf, 'lad' and 'neo' represent the genes for beta- 
lactamase, chloramphenicol acetyltransf erase, lac repressor and neomycin phosphotransferase, respectively; 1o',. 
TE' and T1 ' represent transcriptional tenninators of phage lambda, TE of phage T7 and T1 of the E. coli rmB 
operon; 'repl.' represents the replication regions of plasmids pBR322 and pREP4. 

35 

Figure 1 

is a schematic drawing of the plasmid pREP4. 
Figure 2 

40 is a schematic drawing of the plasmid p6xHis-NANP-Met-Amy. 

Figure 3 

displays that part of the nucleotide sequence of plasmid p6xHis-NANP-Met-Amy (Seq ID No: 7) which encodes 
the fusion protein 6xHis-NANP-l\/1et-Amy (Seq ID No: 8). In this sequence, the recognition sequences of some 
45 of the restriction enzymes depicted In Figure 2 are indicated. The amino acid sequence shown represents In 

the three tetter code the sequence of the fusion protein 6xHls-NANP-Met-Amy, amino acids con^esponding to 
the bA4-peptlde are numbered. 

Figure 4 

50 is a schematic drawing of the plasmid pB/EI -6xHis-NANP-Met-huAmy. 

Figure 5 

displays that part of the nucleotide sequence of plasmid pB/E1 -6xHis-NANP-Met-huAmy (Seq ID No: 9) which 
encodes the fusion protein 6xHis-NANP-Met-huAmy (Seq ID No: 10). In this sequence, the recognition se- 
55 quences of some of the restriction enzymes depicted in Figure 4 are indicated. The amino acid sequence 

shown represents in the three letter code the sequence of the fusion protein 6xHis-NANP-Met-huAmy, amino 
acids corresponding to the bA4-peptlde are numbered. The lower part of the Figure displays the nucleotide 
sequences and the encoded amino acids by which plasmid pB/E1-6xHls-NANP-Met-huAmy[M35E], which 
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encodes the fusion protein 6xHis-NANP-Met-huAmy[M35E], plasmld pB/E1 -6xHis-NANP-Met-huAmy[M35L], 
which encodes the fusion protein 6xHis-NANP-Met-huAmy[M35L], plasmid pB/E1-6xHis-NANP-Met-huAmy 
[M35Q1, which encodes the fusion protein 6xHis-NANP-Met-huAmy[M35Q], and plasmid pB/ 
E1 -6xHls-NANP-l\^et-huAmy[M35S]. which encodes the fusion protein 6xHis-NANP-Met-huAmy[M35S], differ 
s from plasmid pB/E1 -6xHis-NANP-Met-huAmy. 

Figure 6 

gives the resuits of bA4 analysis by non-denaturing agarose gel electropheresis. This analysis was performed 
with "Serum Protein Electrophoresis system Paragon" from Beckman according to the recommendations of 
10 the supplier. 8 fig (in 2 p.1 H2O) of peptide each were applied. Staining was with Coomassle brilliant blue for 3 

h and destatning with 10% acetic acid, 45% methanole and 45% water. Samples were prepared In destilted 
water immediately before the experiment (f) or allowed to age for 2 days (a). The sampler tested are given in 
the list below. 



25 



gel a 




gelb 




lanel 


human bA4 wash 1 f 


lane 1 


M35S bA4 wash 1 f 


lane 2 


human bA4 wash 2 f 


lane 2 


M35S bA4wash2f 


lane 3 


human bA4 wash 3 f 


Jane 3 


M35S bA4 wash 3 f 


lane 4 


human bA4 wash 4 f 


lane 4 


M35S bA4wash4 .f 


lane 5 


human bA4 wash 4 a 


lane 5 


M35S bA4 wash 4 a 


lane 6 


Standard BSA2 mg 


lane 6 


Standard BSA 10mg 


lane 7 


M35L bA4 wash 1 f 


lane 7 


M35Q bA4 wash 1 f 


lane 8 


M35L bA4 wash 2 f 


lane 8 


M35Q bA4wash2f 


lane 9 


M35L bA4 wash 3 f 


lane 9 


M35Q bA4 wash 3 f 


lane 1 0 


M35L bA4 wash 3a 


lane 10 


rat bA4 


geic 









gives the results of a comparison of the point mutants M35Q bA4 and M35E bA4. The point mutant M35E bA4 
contains an extra negative charge which results in a higher mobility on the Beclcman gel (lane 8: MSSE bA4; 
lane 9: M35Q bA4). When M3SE bA4 is mixed with M35Q bA4 after cleavage (lane 1) or when the fusion 
protein containing M35Q bA4 is mixed with the fusion protein containing M35E bA4 and cleaved according to 
the process of the present invention (lanes 3, 4, 5 and 6 (wash 1 , 2, 3 and 4)) a clear separation is observed 
indicating the presence of monomeric bA4. 

Figure 7 

gives the results of bA4 chromatography on a size-exclusion column. 

[0027] Fractions of 250 ml each were collected and analyzed by non-denaturing electrophoresis. Lane 1 : bA4 applied 

onto the column. Lanes 2 to 1 0: peal< fractions. 

[0028] The mari<er proteins used to determine the size of bA4 were : 



serum albumin 


(MW = 


= 65000 ; retention time 


= 19.89 min) , 


ovalbumine 


(MW = 


= 45000 ; retention time 


= 20.10 min) , 


lactalbumin 


(MW = 


= 14200 ; retention time 


^ 21.43 min) and 


insulin 


(MW = 


= 5734 ; retention time = 


21 .68 min) . 



so [0029] The retention time of 22.54 min. points to a monomer with a M W of about 4500 daltons. This Is in agreement 
with light-scattering data and ultracentrifugation experiments by the YphantIs method (D.A. Yphantls. Annals of the N. 
Y. Acad.Sci. 88, 586-601 [1 960]). 

Example 1 

55 

Expression plasmid used for the preparation of the fusion protein 6xHis-NANP-Met-Amy 

[0030] The expression plasmid p6xHis-NANP-Met-Amy (see Figures 2 and 3) was used for the preparation of the 
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fusion protein 6xHis-NANP-Met-Amy. E. coli M15 cells transformed with plasmlds pREP4 and p6xHls-NANP-Met-Anny 
were deposited in accordance with the Budapest Treaty at the Deutsche Sammlung von Mikroorganismen (DSM) In 
Braunschweig. BRD, on May 18, 1993, under the accession number DSM 8310. 

5 Example 2 

Expression plasmid used for the preparation of the fusion protein exHIs-NANP-Met-huAmy 

[0031] The expression plasmid pB/E1-6xHis-NANP-Met-huAmy (see Figures 4 and 5) was used for the preparation 
10 of the fusion protein 6xHis-NANP-Met'huAmy. E. coti M15 cells transformed with plasmlds pREP4 and pB/ 
E1-6xHls-NANP-Met-huAmy were deposited in accordance with the Budapest Treaty at the Deutsche Sammlung von 
Mikroorganismen (DSM) In Braunschweig, BRD, on May 18, 1993, under the accession number DSM 8311 . 

Expression plasmids used for the preparation of the fusion proteins 6xiHis-NANP-Met-huAmy[M35L], BxHIs-NANP-Met- 
15 huAmyfMSSQl 6xHis>NANP'Met-huAmy[M35Sl. and 6xHIS"NANP>Met-huAmy [M35E1 

[0032] The expression plasmlds pB/E1-6xHls-NANP-Met-huAmy[M35L]. pB/E1-6xHis-NANP-Met-huAmy[M35Q], 
pB/E1-6xHis-NANP-Met-huAmy[M35S] and pB/E1-6xHis-NANP-Met-huAmy[M35E]. which differ from plasmid pB/ 
E1-6xHis-NANP-Met-huAmy only In the nucleotides encoding amino acid 35 of the bA4 amyloid peptide (see Figure 

20 5), were used for the preparation of the fusion proteins 6xHis-NANP-Met-huAmy[M35L],6xHls-NANP-Met-huAmy 
[M35Q], 6xHls-NANP-Met-huAmy[M35S] and 6xHis-NANP-Met-huAmy[M35E], respectively. E. coli M15 cells trans- 
formed with plasmids pREP4 and pB/EI -6xHls-NANP-Met-huAmy[M35L], pf=lEP4 and pB/EI -6xHls-NANP-Met-huAmy 
[M35Q], pREP4 and pB/EI -6xHis-NANP-Met-huAmy[M35S] and pREP4 and pB/EI -6xHis-NANP-Met-huAniy[M35E]], 
respectively, were deposited In accordance with the Budapest Treaty at the Deutsche Sammlung von Mikroorganismen 

25 (DSM) in Braunschweig, BRD, on May 18, 1993, under the accession numbers DSM 8313, DSM 8314. DSM 8315 and 
DSM 8312, respectively. 

Example 4 

30 Femientation and purification of fusion proteins 
Femnentation 

[0033] Plasmids p6xHls-NANP-Met-Amy, pB/E1 -6xHls-NANP-Met-huAmy, pB/E1 -6xHis-NANP-Met-huAmy [M35 L], 
35 pB/E1 -6xHls-NANP-Met-hu AmytM35Q] pB/EI -6xHls-NANP-Met-huAmy [M35S] and pB/EI -SxHIs-NANP-Met-huAmy 
[M35E], respectively were transformed into E. coli Ml 5 cells already containing plasmid pREP4 by standard methods 
(Sambrook et al., supra). Transfonned cells were grown at 37"C In a 100 1 femienter in Super medium [Stiiber et al., 
Immunological Methods, eds. Lefkovlts and Pernis, Academic Press, Inc.. Vol.lV, 12M52 [1990]] containing 100 mg/ 
I amplcillln and 25 mg/l kanamycin. At an optical density at 600 nm of about 1 .0 IPTG was added to a final concentration 
40 of 2 mM. After an additional 3 hrs at 37°C the cells were harvested by centrifugation. In a typical fermentation run a 
biomass of approximately 500 g containing at least 3 g of the recombinant fusion protein was obtained. 

Purification 

45 [0034] 2.5 1 of 6 M guanidine-HCI containing 0. 1 M di-sodium hydrogen phosphate, pH 8 were added to the cells and 
stirred for 24 hours. Crude cell debris were removed by centrifugation and the supernatant was then further clarified 
by cross-flow filtration using a 0.3 mm membrane. The protein contained in the filtrate was then adsortDed to a Ni-NTA 
column (5 cm x 24 cm, flow 20 ml/min). Contaminating E.co// proteins were removed by washing first with 8M urea, 
pH 7.5. Elution was performed with 8 M urea, pH 4 . The chromatogram was monitored by SDS-PAGE and fractions 

50 containing fusion protein were pooled. A small aliquot of that pool was mixed with EDTA and desalted by dialysing 
against water, lyophiilzed and then analysed by electron spray mass-spectrometry (Table 1). 



55 
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Table 1 : 



Characterization of the fusion proteins 


Fusion Protein 


Purification yield per 1 00 1 


Theoretical mass 


Average mass by electron spray 




femnenter [gram] 




MS 


Human WT 


3.0 


13817 


13820 


Human Mut. M35S 


5.5 


13773 


13776 


Human Mut. M35L 


5.7 


13799 


13802 


Human Mut M35Q 


4.5 


13814 


13817 


Human Mut. M35E 


6.0 


13813 


not tested 


RatWT 


6.0 


13724 


13728 


WT = wild-type, Mut = mutant, e.g. M35S (Met at position 35 Is mutated to Ser) 



15 : 
Reference Example 1 

Cleavage of the fusion proteins to yield 1-42 b-amyloid peptides 

20 [0035] A semlpreparative RP-18 HPLC (Vidac, specification 218TP 152010, 250 mm x 10 mm) column was first 
equilibrated with 8 M urea, pH 4, at a flow rate of 2 ml/mln. Then an aliquot of the NTA-eluate containing 400 mg fusion 
protein In 8 M urea, pH 4, was pumped onto the column at a flow rate of 1 ml/min. Then the column was washed with 
8 M urea, pH 4, at a flow rate of 2 ml/min. The urea was washed out by water at a flow rate of 2 ml/min until baseline 
adsorbance at the column outlet was reached. Cleavage was perfomied by flushing the column with 45 mg/ml CNBr 

25 in a solution composed of 20% ethanol, 40% fonnic acid and 40% water for 24 hrs at 22 °C at a flow rate of 0.6 ml/ 
min. The column was then flushed with 0.1 M EDTA at a flow rate of 2 ml/min and CNBr together with liberated MRG- 
SHHHHHHGS-(NANP)ig-RSM was washed out with 0.05%trifiuoro acetic acid at afiow rate of 2.0 ml/mln. 1 -42 residue 
b-amyloid peptide was eluted at a flow rate of 2 ml/mln using the following ethanol gradient given by the time points : 
(min/% ethanol) 0/0, 40/40. 45/50, 50/65, 55/1 00, 60/1 00, 65/0 

30 [0036] A broad peak containing the b-amylold peptide emerges between 45 and 60 min. Critical for the peptide to 
be monomeric was the immediate dilution with distilled water (e.g. by dropping the eluate into a stirred beakercontaining 
200 ml H2O), and immediate lyophiiization. The resulting powder was named W1 . Since a considerable amount of b- 
amyloid peptide remained on the column, the elutlon was repeated three times using the above mentioned protocol, 
giving rise to samples W2, W3 and W4. These samples were tested for purity (Table 2) and the amont of monomeric 

35 bA4 [Figure 6). The correct chemical structure of the peptides was verified by electron spray mass-spectrometry (Table 
3), by amino acid analysis and by amino temnlnai sequencing (Table 3). 



Table 2: 



Production of 1-42 b-amylold peptides 


Peptide 


Number of tests 


Wash Number 


Mass produced -(mg) 


Purity (%) 


rat WT (Seq ID No:2) 


4 


1 


100±14 


90 ±2 






2 


28±8 


89 ±2 






3/4 


9±3 


90±1 


human WT (Seq ID No:1) 


3 


1 


96 ± 23 


79 ±9 






2 


52 ±10 


73 ±5 






3 


32±9 


78 ±4 






4 


15±8 


86 ±5 



55 
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Table 2 : (continued) 





Production of 1 -42 b-amyloid peptides 


Peptide 


Number of tests 


Wash Number 


Mass produced (mg) 


Purity (%) 


5 


human M35S (Seq ID No:4) 


3 


1 


71 ± 13 


80 ± 8 








2 


31 ±5 


74 ±2 








3 


18±7 


83 ±2 


10 






4 


2±7 


88 ±2 




human M35L (Seq ID No:3) 


3 


1 


59±15 


72 ±8 








2 


23 ±9 


71 ±4 


15 






3 


8±3 


78 ±3 




human M35Q (Seq ID No:5) 


4 


1 


37±4 


70 ±2 








2 


10±5 


82 ±3 


20 














human M35E (Seq D No:6) 


1 


1 


95 


not tested 








2 


68 


not tested 








3 


46 


not tested 


25 






4 


31 


not tested 



[0037] Purity Is based on amino acid analysis, the content of fusion protein Is detected by the quotient of Asp to Glu. 
Pure 1-42 b-amylold peptide has 4 Glu and 4 Asp residues, pure fusion protein has a ratio of 42 Asp to 4 Gtu. 
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Identification of 1 -42 b-amyloid peptides 


Sample 


Mass Spectrometry 


Edman degradation 


Theory 


Found 


1 0-1 5 cycles 


human WT 


4515.1 


4531** 


DAEFRHDSGYEVHHQ 


human M35S 


4471.0 


4472 


DAEFRHDSGY 


human M35L 


4497,1 


4498 


DAEFRHDSGY 


human M35Q 


4512.0 


4512 


not tested 


human M35E 


4513,0 


4512 


not tested 


ratWT 


4417.0 


4435** 


DAEFGHDSGF 



** Hie methionines of human WT and rat WT were transfomied during the cleavage procedure to methionine sulfoxide. 



45 Reference Example 2 

Purification of monomeric 1-42 bA4 
Small scale method 

[0038] Samples containing 1 mg of bA4 were applied to a LKB UltroPac HPLC column (diameter: 7.5 mm, length : 
600 mm, flow 0.5 mlAnln, buffer 12mM Tris(hydroxymethyl)aminomethan containing 200 mM glycine, pH 7.8). bA4 
emerged in a sharp peak, and the peak fractions contained the Coomassie blue band on the agarose electrophoresis 
gel shown in Figure 7. Light-scattering experiments gave no evidence of high molecular weight forms (aggregates or 
55 , fibres) and calibration standards which were chromatographed under Identical conditions pointed to a molecular weight 
of about 4500, indicating that bA4 was present as a monomer 
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Preparative scale method 

[0039] The "Continuous Eiution Electrophoresis system Uodel 491 Prep Cell" from BioRad was used. The non- 
denaturating discontinuous acrylamide gel was composed of a 4 % acryiamide /2.7 % N,N'-methylene-bis-acrylamlde 
5 separating gel in 0.375M Tris(hydroxymethyl) amino-methan (pH8,8) with a length of 3 cm and a diameter of 37 mm, 
and equipped with a cooling tube of 20 mm diameter. 

[0040] The stacking gel was composed of 4 % acrylamide / 2.7 % N,N'-methylene-bis-acrylamide In 0,125 M Tris 
(hydroxymethyl)amino-methan (pH 6.8) and had a length of 2 cm. The running buffer was 26 mM 
Tris(hydroxymethyl)aminomethan / 0.2 M glycine, pH 8.3 . For eiution the same buffer was used with a flow rate of 
10 0.75 ml/minutes. 45 mg of bA4 peptide, generally from wash 2, 3 or 4, were dissolved in 7 ml H2O and 1 ml glycerole. 
Electrophoresis was performed at 12 Watt (constant, limits at 500 V and 40 mA), a typical run requiring 4 hours. The 
results of a typical run are shown In Figure 7. 

Example 5 

15 

Purification of HIV-1 envelope peptide 

[0041] 400 mg fusion protein MRGS (Hjg GS {NANP)^q RSM RILA VERYLKDQQLLGIWGCSGKLICTTAVPWNAS 
(Seq ID No:1 2) prepared and purified by the same methods as those described for the preparation and purification of 
20 the fusion proteins containing monomeric 1 -42 bA4 (Examples 1 - 4) were loaded onto a Vydac RP 1 8 column (spec- 
ification 21 8 TP 1 5201 0, 250 mm x 1 0 mm) and cleaved using the same conditions as those described for the cleavage 
of the 1 - 42 bA4 fusion proteins (Reference Example 1 ). Approximately 20 - 30 mg of lyophilised powder were obtained. 
This powder was analysed by electron spray analysis. A peak of 3902 ± 2 Da corresponding to the HIV-1 envelope 
peptide (Seq ID No:11) was detected. 

25 

SEQUENCE LISTING 
[0042] 

30 (1) GENERAL INFORMATION: 

(I) APPLICANT 

(A) NAME: F. HOFFMANN-LA ROCHE AG 
35 (B) STREET: Grenzacherstrasse 124 

(C) CITY: Basle 

(D) STATE: BS 

(E) COUNTRY: Switzeriand 

(F) POSTAL CODE (ZIP): CH.4002 
40 (G) TELEPHONE: 061 - 688 42 56 

(H) TELEFAX: 061 - 688 1 3 95 

(I) TELEX: 962292/965542 hirch 

(II) TITLE OF INVENTION: Process for producing hydrophobic polypeptides, proteins or peptides 

45 

(ill) NUMBER OF SEQUENCES: 1 2 
(iv) COMPUTER READABLE FORM: 

50 (A) M EDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1 .0. Version #1 .25 (EPO) 

S5 (2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 42 amino acids 
{B)TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: N-temilnal 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys 
1 5 10 15 

Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys GlyAla lie lie 
20 25 30 

Gly Leu Met Val Gly Gly Val Val Be Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 2: 

(I) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 42 amino acids- 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: N-temninal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Ala Glu Phe Gly His Asp Ser Gly Phe Glu Val Arg His Gin Lys 
15 10 15 

Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala lie lie 
20 25 30 

Gly Leu Met Val Gly Gly Val Val lie Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 3: 
(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: N-termlnal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp Ala Glu I^e Azg His Asp Ser GI7 lyr Glu Val His His Gin Lys 
15 10 15 

Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He lie 
20 25 30 

Gtjr Leu Leu Val Gly Gly Val Val He Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 anilno acids 

(B) TYPE: annino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(V) FRAGMENT TYPE: N-temiinal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys 
1 5 10 15 



Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala lie ne 
20 25 30 

Gly Leu Ser Val Gly Gly Val Val ne Ala 
35 40 



(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 42 amino acids 

(B) Pr'PE: amino acid 
(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: peptide 

(V) FRAiSMENTTYPE: N-temnlnal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Ala GIu Phe Arg His Asp Ser G7y Tyr Glu Val His His Glix Lys 
1 5 10 15 

Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He lie 
20 25 30 

Gly Leu Gin Val Gly Gly Val Val ne Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 6: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: peptide 
(V) FRAGMENT TYPE: N-temilnal 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Ala Glu Phe Arg His Asp Ser G^ Glu Val His His Gin Lys 
1 5 10 15 

Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He ne 
20 25 30 

Gly Leu Glu Val Gly Gly Val Val He Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQU ENCE CHARACTERISTICS: 
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(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDN ESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAIVIE/KEY: CDS 

(B) LOCATION: 115..516 

(D) OTHER INFORMATION: /products "Amyloid Protein AA" 
(xl) SEQUENCE DESCRIPTION: SEC ID NO: 7: 

CTCGAGAAAT CATAAAAAATTTATITGCTrTGTGACCGGATAAC^ 
60 
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AITGTGAGCG GATAACAATT TCACACAGAA TTCATTAAAG AGGAGAAATT AACT ATG 
117 

Met 
1 

AGAGGATCGCATCACCATCACCATCACGGAl^ 165 
Aig Gly Ser His His His His His His Gly Ser Asn Ala Asn Fro Asn 
S 10 15 

GCG AAC CCG AAC GCG AAC COG AAC GCG AAC COG AAC CCG AAC CCXJ AAC 
213 

Ala Asa Pro Asa Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro Asn 
20 25 30 

GCG AAC COG AAC GCG AAC CCG AAC GCG AAC COG AAC GCG AAC CCG AAC 
261 

Ala Asn Pro Asa Ala Asa Pro Asn Ala Asn Pro Asa Ala Asn Pro Asa 
35 40 45 

GCG AAC CX:G AAC GCG AAC CCG AAC GCG AAC CCG AAC GCG AAC CCG AAC 

309 

Ala Asa Pro Asa AlaAsa Pto Asn Ala Asn Pro Asa Ala Asn Pro Asn 
50 55 60 65 

GCG AAC CCG AAC GCG AAC CCG AAC GCG AAC CCG AAC GCG AAC CCG AAC 

357 

Ala Asn Fro Asn Ala Asn Pro Asn Ala Asn Pro Asa Ala Asn Pro Asn 
70 75 80 

GCG AAC CCG AAC GCG AAC CCG AGATCT ATG GAT GCG GAG TIC GGA CAT 405 
Ala Asn Fro Asn Ala Asn Pro Arg Ser Met Asp Ala Glu Phe GI7 His 
85 90 95 

GATTCAGGCTTCGAAGTCCGCCATCAAAAACTGGTGTrCTITGCAGAA 453 
Asp Ser Gly Phe Glu Val Aig His Gin tys Leu Val Fhe Phe Ala Clu 
100 105 110 

GAT GTC GGTTCA AAC AAA GGT GCC ATC ATT GGA CTC ATG GTG GGT GGC 501 
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Asp Val Gly Ser Asn Lys Gly Ala He lie Gly Leu Met Val Gly Gly 
115 120 125 

GITGTCAIAGCATAAGCTr 523 

ValVaineAla 

130 



(2) INFORMATION FOR SEQ ID NO: 8: 
(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



EP 0 641 861 B1 



Met ^ Gl)r Ser His His His His His His Gly Ser AsQ Ala Pro 
15 10 IS 

20 25 30 

Asn Ala Asa Pxo Asa Ala Asn Pm Asn Ala Asn Ita Asa iUa Astt Pto 
35 40 45 

Asn Ala Asa Pro Asa Ala Asn Pro Asn Ala Asn Pro Asa Ala Asn Pro 
SO 55 60 

Asa Ala Asa Pro Asa Ala Asa Pro Asa Ala Asa Pro Asa Ala Asn Fro 
65 70 75 80 

Asa Ala Asa Pro Asa Ala Asa Pro AigSer Met Asp Ala GIu PheGIy 
85 90 95 

His Asp Ser Gly Phe GIu Val Arg His Gla Lys Leu Val Phe Phe Ala 
100 105 110 

GIu Asp Val Gly Ser Asa Lys Gly Ala lie lie Gly Leu Met Val Gfy 
115 120 125 

GlyValVaineAla 
130 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 115,-516 
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(D) OTHER INFORMATION: /product= "Amyloid Protein AA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTCGAGAAAT CATAAAAAAT TXATTTGCrT TGTGAGCGGA TAACAATIM'iWarAGATrCA 
60 

ATTGTGAGCG GAIA\CAATTTCACACAGAAT^ 
117 

- Met , 

1. 

AGA GGATCO CAT CAC CAT CAC CAT CAC GGATCT AAC GCG AAC CCO AAC 165 
Acg Gly Ser His Kls His His His His Gly Ser Asn Ala Asn Pro Asn 
5 10 15 

COG AAC GCG AAC GCG AAC GCG AAC GCG AAC GCG AAC GCG AAC COG AAC 
213 

Ala Asn Pro Asn Ala Asn Pro Asa Ala Asn Pro Asn Ala Asn Pro AsQ- 
20 25 30 

GCG AAC CCC AAC GCG AAC CCG AAC GCG AAC CCG AAC GCG AAC CCG AAC 
261 



17 



EP 0 641 861 B1 



AlaAsnPxD AsnAlaAsnPro AsnAla Asn Pro AsnAlaAsnProAsn 
35 40 45 

GCG AAC CCG AAC GCG AAC COG AAC COG AAC COG AAC GCG AAC CCG AAC 
309 

Ala Asxi Pro Ash Ala Asn Pro Asn Ala Asn Pro Asn Ala Aszi Pro Asn 
50 55 60 65 

GCG AAC CCG AAC GCG AAC CCG AAC GCG AAC CCG AAC GCG AAC CCG AAC 
357 

Ala Asn Fro Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro Asn 
70 75 80 

GCG AAC CCG AAC GCG AAC CCG AGATCT ATG GAT GCG GAG TIC CGT CAT 405 
Ala Asn Pro Asn Ala Asn Pro Arg Ser Met Asp Ala Glu Fbe Aig His 
85 90 95 

GAT TCA GGC TAT GAA GTC CAC CAT CAA AAA CTG GTG TIC TIT GCA GAA 453 
Asp Ser G|/iyr Glu Val Kls His GlnLys Leu Val Phe Phe Ala Glu 
100 105 110 

GAT CTG GGTTCA AAC AAA GGT GCC ATC ATT GGA CTC ATG GTG GGT GGC 501 
Asp Val Gly Ser Asn Lys Gly Ala ne He GI7 Leu Met Val Gly GI7 
115 120 125 

GTrGTCATAGCATAAGClT 520 

ValVaineAla 

130 

(2) INFORMATION FOR SEQ ID NO: 10: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Met Axg Ser His His His HIS His His Gly Ser Asn Ala Asn Pro 
1 5 10 15 

Asn i\Ia Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro 
20 25 30 

Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro 
35 40 45 

Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro 
SO 55 60 

Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Fro Asn Ala Asn Pro 
65 70 75 80 

Asn Ala Asn Pro Asn Ala Asn Pro Aig Ser Met Asp Ala GIu Phe Axg 
85 90 95 

His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu Val Phe Phe Ala 
100 105 110 

Glu Asp Val GI/ Ser Asn Lys Gly Ala ne ne Gly Leu Met Val Gly 
115 120 125 

Gly Val Val ne Ala 
130 

(2) INFORMATION FOR SEQ ID NO: 11 : 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) IViOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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Aig De Leu Val Glu Arg lyr Leu Lys Asp Gin Gin Leu Leu GI/ 
15 10 15 

He Trp Giy Cys Ser Gfy Lys Leu fle Cys Thr Hir Ala Val Pro Trp 
20 25 30 

AsnAlaSer 
35 

(2) INFORMATION FOR SEQ ID NO: 12: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

bd) SEQUENCE DESCRITOON: SEQ ID NO: 12: 

Met Aig Gfy Ser His His HJs His His His Gly Ser Asn Ala Asn Fro 
1 5 10 IS 

Asn Ala Asn Pro Asn Ala Asn P» Asn Ala Asn Pro Asn Ala Asa Pro 
20 25 30 

Asa Ala Asn Pro Asn Ala Asn Pro Asn Ala Ash Pro Asn Ala Asn Pro 
35 40 45 

Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro 
50 55 60 
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Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro Asn Ala Asn Pro 
65 70 75 80 

Asn Ala Asn Pro Asn Ala Asn Pro Ajg Ser Met Aig ne Leu Ala Val 
85 90 95 

GIu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Glyr lie Trp Gly Cys Ser 
100 105 110 

Giy Lys Leu lie Cys Thr Thr Ala Val Pro Asn Ala Ser 
115 120 125 



Claims 

1 . A fusion protein of the formula: 

A-B-C 

wherein A is a bull<y hydrophilic peptide wherein the bulky hydrophilic peptide has a peptide sequence of the fomnula 

(NANP)x 

wherein x is 1 0-40, preferably 1 9, B is a selected cleavage site and C is a desired hydrophobic polypeptide, protein 
or peptide. 

2. A fusion protein In accordance with claim 1 , wherein the selected cleavage site Is a chemical cleavage site, pref- 
erably a methionine residue which site is specificaily cleaved by cyanogen bromide. 

3. ' A fusion protein in accordance with claim .1 or 2, wherein the desired hydrophobic peptide is the HIV-1 envelope 
peptide with [Seq ID No: 11]. 

4. Genes which code for a fusion protein in accordance with any one of claims 1 -3. ■ 

5. Expression vectors in which a gene in accordance with claim 4 Is operativety linked to an expression control se- 
quence. 

6. A bacterium transformed with an expression vector In accordance with claim 5. 

7. A bacterium as claimed in claim 6 which is E.coli. 

8. A process for production and purification of a desired hydrophobic polypeptide, protein or peptide, which process 
comprises the steps of : 

a) passing an aqueous solution containing fusion protein In accordance with any one of claims 1 -3 through a 
hydrophobic matrix column. 
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b) flushing the column with a solution containing a cleavage reagent or an enzyme, and 

c) removing the resulting desired hydrophobic polypeptide, protein or peptide with an aqueous water misctble 
solvent. 

5 

9. A process in accordance with claim 8, wherein the hydrophobic matrix column is an octadecyl bound silica micro- 
particle column. 

10. A process in accordance with claim 8 or 9, wherein the cleavage reagent is cyanogen bromide. 

10 

11 . A process in accordance with claims 8-10 wherein the desired hydrophobic peptide is the HIV-1 envelope peptide 
with [Seq ID No. 11]. 

12. The use of a fusion protein in accordance with claims 1 - 3 for the production and purification of the desired hy- 
15 drophobic polypeptide, protein or peptide C. 



Patentansprtiche 

20 1 . Fusionsprotein der Formel A-B-C, worin A ein sperriges hydrophiles Peptid ist, wobei das sperrige hydrophile 
Peptid eine Peptidsequenz der Fontiel (NANP)^ hat, worin x 10 bis 40, bevorzugt 19 ist, B eine ausgewahlte 
Schnittstelle ist und C ein gewunschtes hydrophobes Polypeptid, Protein oder Peptid ist. 

2. Fusionsprotein nach Anspruch 1 , wobei die ausgewahlte Schnittstelle eine chemische Schnittstelle Ist, bevorzugt 
25 ein Methioninrest, die spezifisch durch Bromcyan gespalten wird, 

3. Fusionsprotein nach Anspruch 1 oder Anspruch 2, wobei das gewQnschte hydrophobe Peptid das HIV-1 -Envelope- 
Peptid mlt [SEQ ID Nr 11] Ist. 

30 4. Gene, die ein Fusionsprotein nach einem der Anspriiche 1 bis 3 codieren. 

5. Expressionsvektoren, bei denen ein Gen gemalB Anspruch 4 operativ mit einer Expressionskontrollsequenz ver- 
bunden ist. 

Bakterium, das mit einem Expressionsvektor nach Anspruch 5 transformiert ist. 
Bakterium nach Anspruch 6, das £. coli Ist. 

Verfahren zur Herstellung und Reinigung eines gewQnschten hydrophoben Polypeptids, Proteins oder Peptids, 
wobei das Verfahren die Stufen umfasst, dass: 

a) eine wassrige Losung, die ein Fusionsprotein nach einem der Anspruche 1 bis 3 enthalt, uber eine Saule 
mit einer hydrophoben Matrix gelertet wird, 

b) die Saule mit einer Losung gespult wird, die ein Spaltreagenz oder ein Enzym enthalt und 

c) das entstehende gewQnschte hydrophobe Polypeptid, Protein oder Peptid mit einem wSssrlgen wasser- 
mischbaren Ldsungsmittel entfemt wird. 

Verfahren nach Anspruch 8, wobei die Saule mit hydrophober Matrix eine Silbiumdioxidmikroteilchensaule mit 
gebundenen Octadecyigruppen ist. 

Verfahren nach Anspruch 8 oder Anspruch 9, wobei das Spaltreagenz Bromcyan Ist. 

Verfahren nach einem der Anspruche 8 bis 10, wobei das gewunschte hydrophobe Peptid MIV-1-Envelope-Peptid 
mit [SEQ ID Nr. 11] ist. 

Verwendung eines Fusionsproteins nach einem der Anspriiche 1 bis 3 zur Herstellung und Reinigung des ge- 
wunschten hydrophoben Polypeptids, Proteins oder Peptids C. 
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Revendlcations 

1 . Prot^ine de fusion de formute 

A-B-C 

dans laquelie 

A est un peptide hydrophile volumineux, le peptide hydrophile volumineux ayant une sequence peptidique de 
fomriule 

(NANP)x 

dans laquelie x est un nombre de 10 d 40, de pr^f^rence 1 9, 
B est un site de cllvage cholsl et 

C est un polypeptide, une prot^ine ou un peptide hydrophobe d^lr§(e). 

2. Proteine de fusion selon la revendication 1 , dans laquelie le site de clivage choisi est un site de clivage chimique, 
de preference un residu de methionine, ce site 6tant olive de fa9on specifique par du bronnure de cyanog^ne. 

3. Proteine de fusion seion la revendication 1 ou 2. dans laquelie le peptide hydrophobe d4sir6 est le peptide d'en- 
veloppe de VIH-1 ayant la sequence [SEQ ID 1 1]. 

4. Gfenes qui codent pour une proteine de fusion selon Tune quelconque des revendlcations 1 k 3. 

5. Vecteurs d'expression dans lesquels un gene seton la revendication 4 est lie de fa^on operation nelle k une se- 
quence de contrdle de I'expresslon. 

6. Bacterie transformee avec un vecteur d'expression selon la revendication 5, 

7. Bacterie selon la revendication 6, qui est E. colL 

8. Precede de production et de purification d'un polypeptide, d'une proteine ou d'un peptide hydrophobe desire(e), 
ce precede comprenant les etapes selon lesquelles; 

(a) on fait passer une solution aqueuse contenant une proteine de fusion selon I'une quelconque des reven- 
dlcations 1 d 3 d travers une colonne de matrice hydrophobe, 

(b) on rince la colonne avec une solution contenant un reactif de clivage ou une enzyme, et 

(c) on separe le polypeptide, la proteine ou le peptide hydrophobe desire(e) obtenu(e) avec un solvant aqueux 
miscible ^ I'eau. 

9. Precede seion la revendication 8, dans lequel la colonne de matrice hydrophobe est une colonne de micropartlcules 
de silice ayant des groupes octadecyte lies. 

10. Precede selon la revendication 8 ou 9, dans lequel le reactif de clivage est le bromure de cyanogene. 

11. Precede selon les revendlcations 8^10, dans lequel le peptide hydrophobe desire est le peptide d'enveloppe de 
VIH-1 ayant la sequence [SEQ ID N» 11]. 

1 2. Utilisation d'une proteine de fusion selon les revendlcations 1 k 3 pour la production et la purification du polypeptide, 
de la proteine ou du peptide hydrophobe C desire(e). 
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Ftgare2 



bla 



N250PSN250P29 
RBSII 

[His]6 



Xhol 



EcoRi [NANP]19 
amy 



Hindlll 




to 



cat 



repl. 
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Figures 



Xhol 

1 CTCGAGAAATCATAAAAAATTTATTTGCTTTGTGAGCGGATAACAATrATAATAGATTCA 

EcoRI 

61 ATT6T6AGCGGATAACAATTTCACACAGAATTCATTAAAGA6GAGAAATTAACTATGAGA 

MetArg 

121 GGATCGCATCACCATCACCATCACGGATCTAACGCGAACCCGAACGCGAACCCGA^CGCG 
QlySerHisHisHlsHlsHisHisGlySerAsxiAlaAsnProAsnAlaAsnProAsnAIa 



181 AACCCGAACGCGAACCCGAACGCGAACCCGAACGC6AACCCGAACGCGAACCCGAACGCG 
AsnProAsnAlaAsnProAsxiAlaAsnFroAsnAlaAsnProAsnAlaAsnProAsnAla 



241 AACCCGAACGCGAACCCGAACGCGAACCCGAACGCGAACCCGAACGCGAACCCGAACGCG 
AsnProAsziAlaAsnProAsnAlaAsnProAsnAlaAsnProAsnAlaAsnProAsnAla 



301 AACCCGAACGCGAACCCGAACGCGAACCCGAlkCGCGAACCCGAACGCGAACCCGAACGCG 
AsnProAsnAlaAsnProAsnAlaAsnProAsnAlaAsnProAsnAlaAsnProAsnAla 

Bglll 

361 AACCCGAACGCGAACCCGAGATCTATGGATGCGGAGTTCGGACATGATTCAGGCTTCGAA 
AsnProAsnAlaAsnProArgSerMetAspAlaGluPheGlyHisAspSerGlyPheGlu 

1 11 

421 GTCCGCCATCAAAAACTGGTGTTCTTTGCAGAAGATGTGGGTTCAAACAAAGGTGCCATC 
ValArgHisGlnLysLeuValPhePheAlaGluAspValGlySerAsnLysGlyAlalle 

21 . 31 

Hlndlll 

481 ATTGGACTCATGGTGGGTGGCGTTGTCATAGCATAAGCTT 520 (Seq« ID No: 7) 
IleGlyLeuMetValGlyGlyValVallleAla (Seq. ID No: 8) 

41 
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figure 4 
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Figure 5 

Xhol 

1 CTCGAGAAATCATAilAAAATTTATTTGCTTTGTGAGCGGATAZlC^TTATAATAGATTCA 

EcoRI 

61 ATTGTGAGCGGATAACAATTTCACACA6AATTCATTAAAGAGGAGAAATTAACTATGAGA 

MetArg 



121 GGATCGCATCACCATCACCATCACGGATCTAACGCGAACCCGAliCGCGAACCCGAACGCG 
GlySerHlsHisHisHisHlsHisGlySerAsnAlaAsnProAsnAlaAsxiProAsnAla 



181 AACCC6AACGCGAACCCGAACGCGAACCCGAACGCGAACCCGAACGCGAACCCGAACGCG 
AsnProAsnAlaAsnProAsnAlaAsnFroAsxiAlaAsziFrpAsnAlaAsnProAsnAla 



241 AACCCGAACGCGAACCCGAACGCGAACCCGAACGCGAACCCGAACGCGAACCCGAACGCG 
AsnProAsxiAlaAsnProAsnAlaAsnProAsnAlaAsnProAsnAlaAsnProAsnAla 



301 AACCCGAACGCGAACCCGAACGCGAACCCGAACGCGAACCCGAACGCGAACCCGAACGCG 
AsnProAsnAlaAsnP roAsnAIaAsnProAsnAlaAsnPr oAsnAlaAsnProAsnAla 

Bglll 

361 AACCCGAACGCGAACCCGAGATCTATGGATGCGGAGTTCCGTCATGATTCAGGCTATGAA 
AsnProAsnAlaAsnProArgSerMetAspAlaGluPheArgHisAspSerGlyTyrGlu 

X 11 

421 GTCCACCATCAAAAACTGGTGTTCTTTGCAGAAGATGTGGGTTCAAACAAAGGTGCCATC 
ValHisHisGlnLysLeuValPhePheAlaGluAspValGlySerAsnLysGlyAlalle 

21 31 

Hindlll 

481 ATTGGACTCATGGTGGGTGGCGTTGTCATAGCATAAGCTT 520 (Seq.ID No: 9) 

IleGlyLeuMetValGlyGlyValValZleAla (Seq.ID No. 10) 
35 41 



[M35E1: GAG (M35L] : CTG [M35Q] : CAG tM35S] ; TCT 

Glu Leu Gin Ser 

35 35 35 35 
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Figure 7 
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